skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Clay, David_E"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract Preharvest yield estimates can be used for harvest planning, marketing, and prescribing in‐season fertilizer and pesticide applications. One approach that is being widely tested is the use of machine learning (ML) or artificial intelligence (AI) algorithms to estimate yields. However, one barrier to the adoption of this approach is that ML/AI algorithms behave as a black block. An alternative approach is to create an algorithm using Bayesian statistics. In Bayesian statistics, prior information is used to help create the algorithm. However, algorithms based on Bayesian statistics are not often computationally efficient. The objective of the current study was to compare the accuracy and computational efficiency of four Bayesian models that used different assumptions to reduce the execution time. In this paper, the Bayesian multiple linear regression (BLR), Bayesian spatial, Bayesian skewed spatial regression, and the Bayesian nearest neighbor Gaussian process (NNGP) models were compared with ML non‐Bayesian random forest model. In this analysis, soybean (Glycine max) yields were the response variable (y), and spaced‐based blue, green, red, and near‐infrared reflectance that was measured with the PlanetScope satellite were the predictor (x). Among the models tested, the Bayesian (NNGP;R2‐testing = 0.485) model, which captures the short‐range correlation, outperformed the (BLR;R2‐testing = 0.02), Bayesian spatial regression (SRM;R2‐testing = 0.087), and Bayesian skewed spatial regression (sSRM;R2‐testing = 0.236) models. However, associated with improved accuracy was an increase in run time from 534 s for the BLR model to 2047 s for the NNGP model. These data show that relatively accurate within‐field yield estimates can be obtained without sacrificing computational efficiency and that the coefficients have biological meaning. However, all Bayesian models had lowerR2values and higher execution times than the random forest model. 
    more » « less
  2. Abstract Because the manual counting of soybean (Glycine max) plants, pods, and seeds/pods is unsuitable for soybean yield predictions, alternative methods are desired. Therefore, the objective was to determine if satellite remote sensing‐based artificial intelligence (AI) models could be used to predict soybean yield. In the study, multiple remote sensing‐based AI models were developed for soybean growth stage ranging from VE/VC (plant emergence) to R6/R7 (full seed to beginning maturity). The ability of the deep neural network (DNN), support vector machine (SVM), random forest (RF), least absolute shrinkage and selection operator (LASSO), and AdaBoost to predict soybean yield, based on blue, green, red, and near‐infrared reflectance data collected by the PlanetScope satellite at six growth stages, was determined. Remote sensing and soybean yield monitor data from three different fields in 2 years (2019 and 2021) were aggregated into 24,282 grid cells that had the dimensions of 10 m by 10 m. A comparison across models showed that the DNN outperformed the other models. Moreover, as crops matured from VE/VC to R4/R5, theR2value of the models increased from 0.26 to over 0.70. These findings indicate that remote sensing data collected at different growth stages can be combined for soybean yield predictions. Moreover, additional work needs to be conducted to assess the model's ability to predict soybean yield with vegetation indices (VIs) data for fields not used to train the model. 
    more » « less
  3. Abstract Cover crops improve soil health and reduce the risk of soil erosion. However, their impact on the carbon dioxide equivalence (CO2e) is unknown. Therefore, the objective of this 2‐yr study was to quantify the effect of cover crop‐induced differences in soil moisture, temperature, organic C, and microorganisms on CO2e, and to develop machine learning algorithms that predict daily N2O–N and CO2–C emissions. The prediction models tested were multiple linear regression, partial least square regression, support vector machine, random forest (RF), and artificial neural network. Models’ performance was accessed using R2, RMSE and mean of absolute value of error. Rye (Secale cerealeL.) was dormant seeded in mid‐October, and in the following spring it was terminated at corn's (Zea maysL.) V4 growth stage. Soil temperature, moisture, and N2O–N and CO2–C emissions were measured near continuously from soil thaw to harvest in 2019 and 2020. Prior to termination, the cover crop decreased N2O–N emissions by 34% (p = .05), and over the entire season, N2O–N emissions from cover crop and no cover crop treatments were similar (p = .71). Based on N2O–N and CO2–C emissions over the entire season and the estimated fixed cover crop‐C remaining in the soil, the partial CO2ewere −1,061 and 496 kg CO2eha–1in the cover crop and no cover crop treatments, respectively. The RF algorithm explained more of the daily N2O–N (73%) and CO2–C (85%) emissions variability during validation than the other models. Across models, the most important variables were temperature and the amount of cover crop‐C added to the soil. 
    more » « less